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Abstract. Mastermind is a popular board game released in 1971, where 
a codemaker chooses a secret pattern of colored pegs, and a codebreaker 
has to guess it in several trials. After each attempt, the codebreaker 
gets a response from the codemaker containing some information on the 
number of correctly guessed pegs. The search space is thus reduced at 
each turn, and the game continues until the codebreaker is able to find 
the correct code, or runs out of trials. 

In this paper we study several variations of ^^MSP, the problem of com- 
puting the size of the search space resulting from a given (possibly fic- 
titious) sequence of guesses and responses. Our main contribution is a 
proof of the #P-completeness of #MSP under parsimonious reductions, 
which settles an open problem posed by Stuckman and Zhang in 2005, 
concerning the complexity of deciding if the secret code is uniquely de- 
termined by the previous guesses and responses. Similarly, #MSP stays 
#P-complete under Turing reductions even with the promise that the 
search space has at least k elements, for any constant k. (In a regular 
game of Mastermind, k = 1.) 

All our hardness results hold even in the most restrictive setting, in which 
there are only two available peg colors, and also if the codemaker's re- 
sponses contain less information, for instance like in the so-called single- 
count (black peg) Mastermind variation. 
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1 Introduction 

Mastermind at a glance. Mastermind is a code-breaking board game 
released in 1971, which sold over 50 million sets in 80 countries. The Israeli 
postmaster and telecommunication expert Mordecai Meirowitz is usually 
credited for inventing it in 1970, although an almost identical paper-and- 
pencil game called bulls and cows predated Mastermind, perhaps by more 
than a century [1]. 

The classic variation of the game is played between a codemaker, who 
chooses a secret sequence of four colored pegs, and a codebreaker, who 
tries to guess it in several attempts. There are six available colors, and 



Fig. 1. A Mastermind box publislied by Pressman Toy Corporation in 1981, foreshad- 
owing the game's computational hardness. 



the secret code may contain repeated colors. After each attempt, the 
codebreaker gets a rating from the codemaker, consisting in the number 
of correctly placed pegs in the last guess, and the number of pegs that have 
the correct color but are misplaced. The rating does not tell which pegs 
are correct, but only their amount. These two numbers are communicated 
by the codemaker as a sequence of smaller black pegs and white pegs, 
respectively (see Figure 1 , where the secret code is concealed behind a 
shield, and each guess is paired with its rating). If the codebreaker's last 
guess was wrong, he guesses again, and the game repeats until the secret 
code is found, or the codebreaker reaches his limit of ten trials. Ideally, 
the codebreaker plans his new guesses according to the information he 
collected from the previous guesses. Table 1 depicts a complete game of 
Mastermind, where colors are encoded as numbers between zero and five, 
and the codebreaker finally guesses the code at his sixth attempt. 



Table 1. A typical game of Mastermind. 
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Previous work. Recently, Focardi and Luccio pointed out the unex- 
pected relevance of Mastermind in real- life security issues, by showing 
how certain API-level bank frauds, aimed at disclosing user PINs, can be 
interpreted as an extended Mastermind game played between an insider 
and the bank's computers [S]. On the other hand, Goodrich suggested 
some applications to genetics of the Mastermind variation in which scores 
consist of black pegs only, called single-count (black peg) Mastermind [6]. 

As a further generalization of the original game, we may consider 
(n, c) -Mastermind, where the secret sequence consists of n pegs, and there 
are c available colors. Chvatal proved that the codebreaker can always de- 
termine the secret code in (n, c)-Mastermind after at most 2n log2 c+4n + 

guesses, each computable in polynomial time, via a simple divide-and- 
conquer strategy [3]. This upper bound was later lowered by a constant 
factor in [2] , while Goodrich also claimed to be able to lower it for single- 
count (black peg) Mastermind, hence using even less information [6]. Un- 
fortunately, after a careful inspection, Goodrich's method turns out to 
outperform Chvatal's several techniques given in [3] asymptotically (as n 
grows, and c is a function of n) only if n^~^ < c < (3 + e)nlog2n, for 
every e > 0. 

However, despite being able to guess any secret code with an efficient 
strategy, the codebreaker may insist on really minimizing the number of 
trials, either in the worst case or on average. Knuth proposed a heuristic 
that exhaustively searches through all possible guesses and ratings, and 
greedily picks a guess that will minimize the number of eligible solutions, 
in the worst case [8j . This is practical and worst-case optimal for standard 
(4, 6)-Mastermind, but infeasible and suboptimal for even slightly bigger 
instances. The size of the solution space is employed as an ideal quality 
indicator also in other heuristics, most notably those based on genetic 
algorithms [7]. 

In order to approach the emerging complexity theoretic issues, Stuck- 
man and Zhang introduced the Mastermind Satisfiability Problem 
(MSP) for (n, c)-Mastermind, namely the problem of deciding if a given 
sequence of guesses and ratings has indeed a solution, and proved its 
NP-completeness |10j . Similarly, Goodrich showed that also the anal- 
ogous satisfiability problem for single-count (black peg) Mastermind is 
NP-complete [6j. 

Interestingly, Stuckman and Zhang observed that the problem of de- 
tecting MSP instances with a unique solution is Turing-reducible to the 
problem of producing an eligible solution. However, the determination of 
the exact complexity of the first problem is left open |lUj . 



Our contribution. In this paper we study #MSP, the counting prob- 
lem associated with MSP, i.e., the problem of computing the number of 
solutions that are compatible with a given set of guesses and ratings. We 
do this for standard (n, c)-Mastermind, as well as its single-count varia- 
tion with only black peg ratings, and the analogous single-count variation 
with only white peg ratings, both in general and restricted to instances 
with a fixed number of colors c. 

Our main theorem states that, in all the aforementioned variations of 
Mastermind, ^^MSP is either trivially polynomial or #P-complete under 
parsimonious reductions. Capturing the true complexity of ^^MSP is an 
improvement on previous results (refer to |6I10] ) because: 

• Evaluating the size of the search space is a natural and recurring 
subproblem in several heuristics, whereas merely deciding if a set of 
guesses has a solution seems a more fictitious problem, especially be- 
cause in a real game of Mastermind we already know that our previous 
guesses and ratings do have a solution. 

• The reductions we give are parsimonious, hence they yield stronger 
versions of all the previously known NP-completeness proofs for MSP 
and its variations. Moreover, we obtain the same hardness results 
even for (n, 2)-Mastermind, whereas all the previous reductions used 



unboundedly many colors (see Corollary 1 ) . 

Our main theorem enables simple proofs of a wealth of complexity- 
related corollaries, including the hardness of detecting unique solu- 



tions, which was left open in jlOj (see Corollary 3). 



Paper structure. In Section 2] we define #MSP and its variations. 



Section 3 contains a statement and proof of our main result, Theorem 1 



and an example of reduction. In Section 4 we apply Theorem 1 to several 



promise problems with different assumptions on the search space, and 



finally in Section 5 we suggest some directions for further research. 



2 Definitions 



Codes and ratings. For (n, c)-Mastermind, let the set be the code 
space, whose elements are codes of n numbers ranging from to c — 1. 
Following Chvatal, we define two metrics on the code space |3j. If x = 
{xi, • • • , Xn) and y = (yi, • • • , ?/„) are two codes, let a{x, y) be the number 
of subscripts i with Xi = yi, and let f3{x,y) be the largest a{x,y), with y 
running through all the permutations of y. As observed in n — a{x, y) 
and n — (3{x,y) are indeed distance functions, respectively on and 



"^c / Sn (i.e., the code space where codes are equivalent up to reordering 
of their elements). 

Given a secret code s G chosen by the codemaker, we define the 
rating of a guess g G Z", for all the three variants of Mastermind we want 
to model. 

— For standard Mastermind, let p{s,g) = {a{s, g), [3{s, g) — a{s,g)). 

— For single-count black peg Mastermind, let pb{s,g) = a{s,g). 

— For single-count white peg Mastermind, let Pw{s,g) = /3{s,g). 

A guess is considered correct in single-count white peg (n, c)-Mastermind 
whenever its rating is n, therefore the secret code has to be guessed only 
up to reordering of the numbers. As a consequence, the codebreaker can 
always guess the code after c — 1 attempts: He can determine the number 
of pegs of each color via monochromatic guesses, although this is not 
an optimal strategy when c outgrows n. On the other hand, order does 
matter in both other variants of Mastermind, where the guess has to 
coincide with the secret code for the codebreaker to win. 

Satisfiability problems. Next we define the Mastermind Satisfia- 
bility Problem for all three variants of Mastermind. 

Problem 1. MSP (respectively, MSP-Black, MSP-White). 

Input: (n, c, Q), where Q is a finite set of queries of the form (g, r), where 

g G and r is a rating. 

Output: Yes if there exists a code x £ TL^ such that r = g^ (respec- 
tively, r = ph{x,g), r = pw{x,g)) for all {g,r) G Q. No otherwise. 

MSP and MSP-Black are known to be NP-complete problems [ 6|10j . 
We shall see in [Corollary l| how MSP-White is NP-complete, as well. 

Further, we may want to restrict our attention to instances of Mas- 
termind with a fixed number of colors. Thus, for every constant c, let 
(c)-MSP be the restriction of MSP to problem instances with exactly c 
colors (i.e., whose input is of the form (-,0, •)). Similarly, we define (c)- 
MSP-Black and (c)-MSP- White. 

Counting problems. All the above problems are clearly in NP, thus it 
makes sense to consider their counting versions, namely #MSP, #MSP- 
Black, 7^(c)-MSP, and so on, which are all ^^P problems [11]. Basically, 
these problems ask for the size of the solution space after a number of 
guesses and ratings, i.e., the number of codes that are coherent with all 
the guesses and ratings given as input. 



Recall that reductions among #P problems that are based on or- 
acles are called Turing reductions and are denoted with while the 
more specific reductions that map problem instances preserving the num- 
ber of solutions are called parsimonious reductions, and are denoted with 
^pars- Each type of reduction naturally leads to a different notion of 
#P-completeness: For instance, is #P-complete under Turing 

reductions, while is #P-complete under parsimonious reduc- 

tions [9]. Problems that are #P-complete under parsimonious reductions 
are a fortiori NP-complete, while it is unknown whether all NP-complete 
problems are #P-complete, even under Turing reductions [1]. 



3 Counting Mastermind solutions 

Next we give a complete classification of the complexities of all the count- 



ing problems introduced in Section 2 
Theorem 1. 

a) #MSP, #MSP-Black and #MSP- White are i^Y* -complete under 
parsimonious reductions. 

b) 7^(c)-MSP and #(c)-MSP-Black are -complete under parsimo- 
nious reductions for every c ^ 2 . 

c) ^(c) -MSP- White is solvable in deterministic polynomial time for 
every c ^ 1. 

(Notice that #(1)-MSP and #(1)-MSP-Black are trivially solvable in 
deterministic linear time.) 

Lemma 1. For every 1, #(c)-MSP-White is solvable in determin- 
istic polynomial time. 

Proof. In TLc I Sn there are only ("'^^^^) = Q{nP~^) possible codes to check 
against all the given queries, hence the whole process can be carried out 
in polynomial time, for any constant c. □ 

Lemma 2. For every c ^ 1, 

#(c)-MSP ^pars #(C+1)-MSP, 

#(c)-MSP-Black ^pars # (c + 1) -MSP-Black. 

Proof. Given the instance {n,c,Q) of #(c)-MSP (respectively, #(c)- 
MSP-Black), we convert it into {n,c+ 1,QU {{g,r)}), where g is a 
sequence of n consecutive c's, and r = (0,0) (respectively, r = 0). The 
new query (g, r) implies that the new color c does not occur in the secret 
code, hence the number of solutions is preserved and the reduction is in- 
deed parsimonious. □ 



Lemma 3. #3-SAT ^pars #MSP- White. 



Proof. Given a 3-CNF Boolean formula (p with v variables and m clauses, 
we map it into an instance of MSP-White {n,c,Q). For each clause Cj 
of we add three fresh auxiliary variables Oj, bi, Ci. For each variable x 
(including auxiliary variables), we define two colors x and x, representing 
the two possible truth assignments for x. We further add the mask color 
*, thus getting c = 2v + 6m + 1 colors in total. We let n = u + 3m (we 
may safely assume that n ^ 5), and we construct Q as follows. 



1) Add the query *), Oj . 

2) For each variable x, add the query ^(x, x, x, x, *,*,*,•••,*), 1 

3) For each clause Ci = {^1,^25^3} (where each literal may be positive 
or negative), add the query ^(^1, ^2, ^3, ^i, bi, *,*,*,•••,*), 3^ . 

4) For each clause Cj, further add the query ( (a^, 6j, q, *,*,*,•••,*), 2 ) . 



By (1) , the mask color does not occur in the secret code; by (2) , each 
variable occurs in the secret code exactly once, either as a positive or 
a negative literal. Moreover, by |(3)[ at least one literal from each clause 
must appear in the secret code. Depending on the exact number of literals 
from Ci that appear in the code (either one, two or three), the queries 
in 1(3)1 and 1(4)1 always force the values of the auxiliary variables Oj, bi and 
Cj. (Notice that, without [(4)| there would be two choices for Oj and bi, in 
case exactly two literals of Ci appeared in the code.) As a consequence, 
the reduction is indeed parsimonious. □ 

Lemma 4. #3-SAT ^pars #(2)-MSP-Black. 



Proof. We proceed along the lines of the proof of Lemma 3 with similar 
notation. We add the same auxiliary variables Oj, bi, Ci for each clause 
Ci, and we construct the instance of (2)-MSP-Black {2n,2,Q), where 
n = V + 3m. This time we encode literals as positions in the code: For 
each variable x, we allocate two specific positions x and x, so that gx = 1 
(respectively, gx = 1) in code g = (51, • • • ,g2n) if and only if variable x is 
assigned the value true (respectively, false). Notice that, in contrast with 



|Lemma"3 we are not using a mask color here. Q is constructed as follows. 



1) Add the query (^(0, 0, 0, • • • , 0), nj . 

2) For each variable x, add the query {g, n), where gj = 1 if and only if 
j e {x,x}. 



3) For each clause Ci = {ii,i2, ^3}, add the query (g, n + 1), where gj = 1 
if and only ifj G {ii,i2,£3,ai,bi}. (Without loss of generality, we may 
assume that £1, £2 and £3 are occurrences of three mutually distinct 
variables [H].) 

4) For each clause Ci, further add the query {g,n + 1), where gj = 1 if 
and only if j € {a^, 6j, q}. 

By |(l)| every solution must contain n times and n times 1, in some order. 



The semantics of (2) , (3) and | (4) | is the same as that of the corresponding 



steps in Lemma 3} hence our construction yields the desired parsimonious 



reduction. Indeed, observe that, if altering k bits of a binary code increases 
its rating by r, then exactly of those k bits are set to the right value. 



In (2) , altering k = 2 bits of the code in 1(1)1 increases its rating by r = 0, 



hence exactly one of those bits has the right value, which means that 



Sx 7^ Sx in any solution s. Similarly, in |(3)| (respectively, (4)), A: = 5 
(respectively. A; = 3) and r = 1, hence exactly three (respectively, two) of 
the bits set to 1 are correct (cf. the ratings in [Lemma 3 ). □ 



Lemma 5. #3-SAT ^pars #(2)-MSP. 



Proof. We replicate the construction given in the proof of Lemma 4 , but 
we use the proper ratings: Recall that the ratings of MSP are pairs 
of scores (black pegs and white pegs). The first score (black pegs) has 
the same meaning as in MSP-Black, and we maintain these scores un- 
changed from the previous construction. By doing so, we already get the 
desired set of solutions, hence we merely have to show how to fill out the 
remaining scores (white pegs) without losing solutions. 



Referring to the proof of Lemma 4 we change the rating in (1) from 



n to (n, 0), because every in the guess is either correct at the correct 
place, or redundant. 

The rating in |(2)| is changed from n to (n, 2). Indeed, let y be any other 
variable (distinct from x), so that 9y = dy = 0. Then, exactly one between 
gy and gy is a misplaced 0, which can be switched with the misplaced 1 
from either g^ or gx- All the other O's in g are either correct at the correct 
place, or redundant. 

Similarly, the rating in |(3)| (respectively, |(4)[ ) changes from n + 1 to 
(n + 1,4) (respectively, (n + 1,2)). Indeed, exactly two (respectively, one) 
I's are in a wrong position in g. If either gx = 1 or gx = ^ 'is wrong, 
then both gx and gx are wrong and of opposite colors, hence they can 
be switched. Once again, all the other O's in g are either correct at the 
correct place, or redundant. □ 



Proof ( of Theorem 1 ). All the claims easily follow from Lemma 1 Lemma 2 
Lemma 3, Lemma 4, Lemma 5, and the #P-completeness of tt^S-SAT 
under parsimonious reductions [9j. □ 



Example. As an illustration of [Lemma 5 we show how the Boolean 
formula (x V -ly V z) A {-^x V yV w) A {yV -^w) is translated into a set 
of queries for (2)-MSP. For visual convenience, O's and I's are represented 
as white and black circles, respectively. 
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The solutions to both problems are exactly ten, and are listed below. 
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We remark that, in order to determine the values of the auxiUary 
variables Oj, bi and Cj when a solution to the Boolean satisfiability problem 
is given, it is sufficient to check how many literals of Ci are satisfied, 
is true if and only if exactly one literal is satisfied, bi is false if and only 
if all three literals are satisfied, and q is true if and only if aj = 6j. 



4 Related results 



We describe some applications of Theorem 1 to several complexity prob- 
lems. 

Corollary 1. (2)-MSP, (2)-MSP-Black and MSP-White are NP- 
complete. 

Proof. Parsimonious reductions among problems are a fortiori Karp 
reductions among the corresponding NP problems. □ 

So far, we made no assumptions on the queries in our problem in- 
stances, which leads to a more general but somewhat fictitious theory. 
Since in a real game of Mastermind the codebreaker's queries are guaran- 
teed to have at least a solution (i.e., the secret code chosen by the code- 
maker), more often than not the codebreaker is in a position to exploit 
this information to his advantage. However, we show that such informa- 
tion does not make counting problems substantially easier. 

Corollary 2. #(2)-MSP, #(2)-MSP-Black and #MSP-White, with 
the promise that the number of solutions is at least k, are all -complete 
problems under Turing reductions, for every k 1. 

Proof Let ^^Match be the problem of counting the matchings of any 
size in a given graph, which is known to be #P-complete under Turing 
reductions [12]. Let be the problem #(2)-MSP (respectively, #(2)- 
MSP-Black, #MSP-White) restricted to instances with at least k so- 
lutions, and let us show that #Match ^k- Given a graph G, if it 
has fewer than k edges, we can count all the matchings in linear time. 
Otherwise, there must be at least k matchings (each edge e yields at least 
the matching {e}), so we parsimoniously map G into an instance of Hk 
via Theorem 1[ we call an oracle for H^, and output its answer. □ 



The following result, for k = 1, settles an issue concerning the deter- 
mination of MSP instances with unique solution, which was left unsolved 
in [To]. We actually prove more: Even if a solution is given as input, it is 
hard to determine if it is unique. Therefore, not only solving Mastermind 
puzzles is hard, but designing puzzles around a solution is also hard. 



Corollary 3. For every k ^ 1, the problem of deciding if an instance 
o/(2)-MSP, (2)-MSP-Black or MSP-White has strictly more than k 
solutions is 'NP-complete, even ifk solutions are explicitly given as input. 



Proof. Not only do the parsimonious reductions given in Theorem 1 
preserve the number of solutions, but they actually yield an explicit 
polynomial-time computable transformation of solutions (cf. the remark 



at the end of Section 3). Hence, the involved #P-complete problems are 
also ASP-complete as function problems, and their decision fc-ASP coun- 
terparts are accordingly NP-complete |14j . □ 



Remarkably, even if the codebreaker somehow knows that his previous 
queries are sufficient to uniquely determine the solution, he still has a hard 
time finding it. 

Corollary 4. The promise problem of finding the solution to an instance 
o/(2)-MSP, (2)-MSP-Black or MSP-White, when the solution itself 
is known to be unique, is ISiP-hard under randomized Turing reductions. 

Proof. It is known that SAT ^rp USAT, where USAT is the promise 
version of SAT whose input formulas are known to have either zero or 
one satisfying assignments |13j . Let / be the composition of this reduction 
with the parsimonious one from Boolean formulas to instances of (2)- 



MSP (respectively, (2)-MSP-Black, MSP-White) given by |TheoreniT 
Our Turing reduction proceeds as follows: Given a Boolean formula ip, 
compute f{ip) and submit it to an oracle that finds a correct solution 
s of (2)-MSP (respectively, (2)-MSP-Black, MSP-White) when it is 
unique. Then output Yes if and only if s is indeed a solution of /(</?), 
which can be checked in polynomial time. □ 

5 Further research 



In Lemma 1 we showed that 7^(c)-MSP-White is solvable in polynomial 



time when c is a constant, while in Lemma 3 we proved that it becomes 



:^P-complete when c = 2n -|- 1. By making the code polynomially longer 
and filling the extra space with a fresh color, we can easily prove that 
also #(6'(-v/n))-MSP-WHiTE is #P-complete, for every constant k. An 
obvious question arises: What is the lowest order of growth of c(n) such 
that #(c(n))-MSP- White is #P-complete? 

We observed that #MSP is a subproblem of several heuristics aimed 
at optimally guessing the secret code, but is #MSP really inherent in the 
game? Perhaps the hardness of Mastermind is not captured by #MSP 
or even MSP, and there are cleverer, yet unknown, ways to play. 



Problem 2. Mastermind. 

Input: (n, c, Q, k), where (n, c, Q) is an instance of MSP, and A; ^ 0. 
Output: Yes if the codebreaker has a strategy to guess the secret code in 
at most k attempts, using information from Q. No otherwise. 

Notice that Mastermind belongs to PSPACE, due to the polynomial 
upper bound on the length of the optimal strategy given by Chvatal [3] . 
Our question is whether Mastermind is PSPACE-complete. 

To make the game more fun to play for the codemaker, whose role is 
otherwise too passive, we could let him change the secret code at every 
turn, coherently with the ratings of the previous guesses of the code- 
breaker. As a result, nothing changes for the codebreaker, except that he 
may perceive to be quite unlucky with his guesses, but the codemaker's 



game becomes rather interesting: By Corollary 3 , even deciding if he has 



a non-trivial move is NP-complete, but he can potentially force the code- 
breaker to always work in the worst-case scenario, and make him pay for 
his mistakes. We call this variation adaptive Mastermind. 
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